Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 242.577
Filtrar
1.
Sci Rep ; 14(1): 7749, 2024 04 02.
Artigo em Inglês | MEDLINE | ID: mdl-38565703

RESUMO

DPP4 inhibitors can control glucose homeostasis by increasing the level of GLP-1 incretins hormone due to dipeptidase mimicking. Despite the potent effects of DPP4 inhibitors, these compounds cause unwanted toxicity attributable to their effect on other enzymes. As a result, it seems essential to find novel and DPP4 selective compounds. In this study, we introduce a potent and selective DPP4 inhibitor via structure-based virtual screening, molecular docking, molecular dynamics simulation, MM/PBSA calculations, DFT analysis, and ADMET profile. The screened compounds based on similarity with FDA-approved DPP4 inhibitors were docked towards the DPP4 enzyme. The compound with the highest docking score, ZINC000003015356, was selected. For further considerations, molecular docking studies were performed on selected ligands and FDA-approved drugs for DPP8 and DPP9 enzymes. Molecular dynamics simulation was run during 200 ns and the analysis of RMSD, RMSF, Rg, PCA, and hydrogen bonding were performed. The MD outputs showed stability of the ligand-protein complex compared to available drugs in the market. The total free binding energy obtained for the proposed DPP4 inhibitor was more negative than its co-crystal ligand (N7F). ZINC000003015356 confirmed the role of the five Lipinski rule and also, have low toxicity parameter according to properties. Finally, DFT calculations indicated that this compound is sufficiently soft.


Assuntos
Inibidores da Dipeptidil Peptidase IV , Simulação de Dinâmica Molecular , Inibidores da Dipeptidil Peptidase IV/farmacologia , Simulação de Acoplamento Molecular , Sítios de Ligação , Dipeptidil Peptidase 4 , Teoria da Densidade Funcional , Ligantes
2.
Methods Mol Biol ; 2797: 67-90, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38570453

RESUMO

Molecular docking is a popular computational tool in drug discovery. Leveraging structural information, docking software predicts binding poses of small molecules to cavities on the surfaces of proteins. Virtual screening for ligand discovery is a useful application of docking software. In this chapter, using the enigmatic KRAS protein as an example system, we endeavor to teach the reader about best practices for performing molecular docking with UCSF DOCK. We discuss methods for virtual screening and docking molecules on KRAS. We present the following six points to optimize our docking setup for prosecuting a virtual screen: protein structure choice, pocket selection, optimization of the scoring function, modification of sampling spheres and sampling procedures, choosing an appropriate portion of chemical space to dock, and the choice of which top scoring molecules to pick for purchase.


Assuntos
Algoritmos , Proteínas Proto-Oncogênicas p21(ras) , Simulação de Acoplamento Molecular , Proteínas Proto-Oncogênicas p21(ras)/genética , Proteínas Proto-Oncogênicas p21(ras)/metabolismo , Software , Proteínas/química , Descoberta de Drogas , Ligantes , Ligação Proteica , Sítios de Ligação
3.
Methods Mol Biol ; 2797: 115-124, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38570456

RESUMO

Fragment-based screening by ligand-observed 1D NMR and binding interface mapping by protein-observed 2D NMR are popular methods used in drug discovery. These methods allow researchers to detect compound binding over a wide range of affinities and offer a simultaneous assessment of solubility, purity, and chemical formula accuracy of the target compounds and the 15N-labeled protein when examined by 1D and 2D NMR, respectively. These methods can be applied for screening fragment binding to the active (GMPPNP-bound) and inactive (GDP-bound) states of oncogenic KRAS mutants.


Assuntos
Descoberta de Drogas , Proteínas Proto-Oncogênicas p21(ras) , Proteínas Proto-Oncogênicas p21(ras)/genética , Ligantes , Espectroscopia de Ressonância Magnética , Proteínas , Ligação Proteica , Sítios de Ligação
4.
Elife ; 122024 Apr 03.
Artigo em Inglês | MEDLINE | ID: mdl-38567819

RESUMO

Based on experimentally determined average inter-origin distances of ~100 kb, DNA replication initiates from ~50,000 origins on human chromosomes in each cell cycle. The origins are believed to be specified by binding of factors like the origin recognition complex (ORC) or CTCF or other features like G-quadruplexes. We have performed an integrative analysis of 113 genome-wide human origin profiles (from five different techniques) and five ORC-binding profiles to critically evaluate whether the most reproducible origins are specified by these features. Out of ~7.5 million union origins identified by all datasets, only 0.27% (20,250 shared origins) were reproducibly obtained in at least 20 independent SNS-seq datasets and contained in initiation zones identified by each of three other techniques, suggesting extensive variability in origin usage and identification. Also, 21% of the shared origins overlap with transcriptional promoters, posing a conundrum. Although the shared origins overlap more than union origins with constitutive CTCF-binding sites, G-quadruplex sites, and activating histone marks, these overlaps are comparable or less than that of known transcription start sites, so that these features could be enriched in origins because of the overlap of origins with epigenetically open, promoter-like sequences. Only 6.4% of the 20,250 shared origins were within 1 kb from any of the ~13,000 reproducible ORC-binding sites in human cancer cells, and only 4.5% were within 1 kb of the ~11,000 union MCM2-7-binding sites in contrast to the nearly 100% overlap in the two comparisons in the yeast, Saccharomyces cerevisiae. Thus, in human cancer cell lines, replication origins appear to be specified by highly variable stochastic events dependent on the high epigenetic accessibility around promoters, without extensive overlap between the most reproducible origins and currently known ORC- or MCM-binding sites.


Assuntos
Complexo de Reconhecimento de Origem , Proteínas de Saccharomyces cerevisiae , Humanos , Complexo de Reconhecimento de Origem/genética , Complexo de Reconhecimento de Origem/metabolismo , Origem de Replicação/genética , Sítios de Ligação , Replicação do DNA/genética , Saccharomyces cerevisiae/genética , Proteínas de Saccharomyces cerevisiae/metabolismo , Cromossomos Humanos/metabolismo , DNA/metabolismo , Proteínas de Ciclo Celular/metabolismo
5.
BMC Bioinformatics ; 25(1): 156, 2024 Apr 20.
Artigo em Inglês | MEDLINE | ID: mdl-38641811

RESUMO

BACKGROUND: Accurately identifying drug-target interaction (DTI), affinity (DTA), and binding sites (DTS) is crucial for drug screening, repositioning, and design, as well as for understanding the functions of target. Although there are a few online platforms based on deep learning for drug-target interaction, affinity, and binding sites identification, there is currently no integrated online platforms for all three aspects. RESULTS: Our solution, the novel integrated online platform Drug-Online, has been developed to facilitate drug screening, target identification, and understanding the functions of target in a progressive manner of "interaction-affinity-binding sites". Drug-Online platform consists of three parts: the first part uses the drug-target interaction identification method MGraphDTA, based on graph neural networks (GNN) and convolutional neural networks (CNN), to identify whether there is a drug-target interaction. If an interaction is identified, the second part employs the drug-target affinity identification method MMDTA, also based on GNN and CNN, to calculate the strength of drug-target interaction, i.e., affinity. Finally, the third part identifies drug-target binding sites, i.e., pockets. The method pt-lm-gnn used in this part is also based on GNN. CONCLUSIONS: Drug-Online is a reliable online platform that integrates drug-target interaction, affinity, and binding sites identification. It is freely available via the Internet at http://39.106.7.26:8000/Drug-Online/ .


Assuntos
Aprendizado Profundo , Interações Medicamentosas , Sítios de Ligação , Sistemas de Liberação de Medicamentos , Avaliação Pré-Clínica de Medicamentos
6.
Genome Biol ; 25(1): 83, 2024 Apr 02.
Artigo em Inglês | MEDLINE | ID: mdl-38566111

RESUMO

BACKGROUND: The rise of large-scale multi-species genome sequencing projects promises to shed new light on how genomes encode gene regulatory instructions. To this end, new algorithms are needed that can leverage conservation to capture regulatory elements while accounting for their evolution. RESULTS: Here, we introduce species-aware DNA language models, which we trained on more than 800 species spanning over 500 million years of evolution. Investigating their ability to predict masked nucleotides from context, we show that DNA language models distinguish transcription factor and RNA-binding protein motifs from background non-coding sequence. Owing to their flexibility, DNA language models capture conserved regulatory elements over much further evolutionary distances than sequence alignment would allow. Remarkably, DNA language models reconstruct motif instances bound in vivo better than unbound ones and account for the evolution of motif sequences and their positional constraints, showing that these models capture functional high-order sequence and evolutionary context. We further show that species-aware training yields improved sequence representations for endogenous and MPRA-based gene expression prediction, as well as motif discovery. CONCLUSIONS: Collectively, these results demonstrate that species-aware DNA language models are a powerful, flexible, and scalable tool to integrate information from large compendia of highly diverged genomes.


Assuntos
DNA , Sequências Reguladoras de Ácido Nucleico , Sítios de Ligação , Alinhamento de Sequência , Algoritmos , Sequência Conservada/genética , Evolução Molecular
7.
Sci Rep ; 14(1): 9155, 2024 04 21.
Artigo em Inglês | MEDLINE | ID: mdl-38644393

RESUMO

Deep learning models (DLMs) have gained importance in predicting, detecting, translating, and classifying a diversity of inputs. In bioinformatics, DLMs have been used to predict protein structures, transcription factor-binding sites, and promoters. In this work, we propose a hybrid model to identify transcription factors (TFs) among prokaryotic and eukaryotic protein sequences, named Deep Regulation (DeepReg) model. Two architectures were used in the DL model: a convolutional neural network (CNN), and a bidirectional long-short-term memory (BiLSTM). DeepReg reached a precision of 0.99, a recall of 0.97, and an F1-score of 0.98. The quality of our predictions, the bias-variance trade-off approach, and the characterization of new TF predictions were evaluated and compared against those produced by DeepTFactor, as well as against experimental data from three model organisms. Predictions based on our DLM tended to exhibit less variance and bias than those from DeepTFactor, thus increasing reliability and decreasing overfitting.


Assuntos
Aprendizado Profundo , Fatores de Transcrição , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismo , Biologia Computacional/métodos , Células Procarióticas/metabolismo , Redes Neurais de Computação , Eucariotos/genética , Genoma , Células Eucarióticas/metabolismo , Sítios de Ligação
8.
Int J Mol Sci ; 25(7)2024 Apr 02.
Artigo em Inglês | MEDLINE | ID: mdl-38612757

RESUMO

Wildtype Escherichia coli cells cannot grow on L-1,2-propanediol, as the fucAO operon within the fucose (fuc) regulon is thought to be silent in the absence of L-fucose. Little information is available concerning the transcriptional regulation of this operon. Here, we first confirm that fucAO operon expression is highly inducible by fucose and is primarily attributable to the upstream operon promoter, while the fucO promoter within the 3'-end of fucA is weak and uninducible. Using 5'RACE, we identify the actual transcriptional start site (TSS) of the main fucAO operon promoter, refuting the originally proposed TSS. Several lines of evidence are provided showing that the fucAO locus is within a transcriptionally repressed region on the chromosome. Operon activation is dependent on FucR and Crp but not SrsR. Two Crp-cAMP binding sites previously found in the regulatory region are validated, where the upstream site plays a more critical role than the downstream site in operon activation. Furthermore, two FucR binding sites are identified, where the downstream site near the first Crp site is more important than the upstream site. Operon transcription relies on Crp-cAMP to a greater degree than on FucR. Our data strongly suggest that FucR mainly functions to facilitate the binding of Crp to its upstream site, which in turn activates the fucAO promoter by efficiently recruiting RNA polymerase.


Assuntos
Escherichia coli , Fucose , Sítios de Ligação , Escherichia coli/genética , Óperon/genética , Fosforilação
9.
J Chem Theory Comput ; 20(8): 2985-2991, 2024 Apr 23.
Artigo em Inglês | MEDLINE | ID: mdl-38602504

RESUMO

The Protein Structure Transformer (PeSTo), a geometric transformer, has exhibited exceptional performance in predicting protein-protein binding interfaces and distinguishing interfaces with nucleic acids, lipids, small molecules, and ions. In this study, we introduce PeSTo-Carbs, an extension of PeSTo specifically engineered to predict protein-carbohydrate binding interfaces. We evaluate the performance of this approach using independent test sets and compare them with those of previous methods. Furthermore, we highlight the model's capability to specialize in predicting interfaces involving cyclodextrins, a biologically and pharmaceutically significant class of carbohydrates. Our method consistently achieves remarkable accuracy despite the scarcity of available structural data for cyclodextrins.


Assuntos
Carboidratos , Aprendizado Profundo , Ligação Proteica , Proteínas , Proteínas/química , Proteínas/metabolismo , Carboidratos/química , Sítios de Ligação
10.
J Chem Inf Model ; 64(8): 3465-3476, 2024 Apr 22.
Artigo em Inglês | MEDLINE | ID: mdl-38602938

RESUMO

Many biological functions are mediated by large complexes formed by multiple proteins and other cellular macromolecules. Recent progress in experimental structure determination, as well as in integrative modeling and protein structure prediction using deep learning approaches, has resulted in a rapid increase in the number of solved multiprotein assemblies. However, the assembly process of large complexes from their components is much less well-studied. We introduce a rapid computational structure-based (SB) model, GoCa, that allows to follow the assembly process of large multiprotein complexes based on a known native structure. Beyond existing SB Go̅-type models, it distinguishes between intra- and intersubunit interactions, allowing us to include coupled folding and binding. It accounts automatically for the permutation of identical subunits in a complex and allows the definition of multiple minima (native) structures in the case of proteins that undergo global transitions during assembly. The model is successfully tested on several multiprotein complexes. The source code of the GoCa program including a tutorial is publicly available on Github: https://github.com/ZachariasLab/GoCa. We also provide a web source that allows users to quickly generate the necessary input files for a GoCa simulation: https://goca.t38webservices.nat.tum.de.


Assuntos
Conformação Proteica , Proteínas , Proteínas/química , Proteínas/metabolismo , Sítios de Ligação , Modelos Moleculares , Software , Complexos Multiproteicos/química , Complexos Multiproteicos/metabolismo
11.
Nat Commun ; 15(1): 3146, 2024 Apr 11.
Artigo em Inglês | MEDLINE | ID: mdl-38605029

RESUMO

Despite their lack of a defined 3D structure, intrinsically disordered regions (IDRs) of proteins play important biological roles. Many IDRs contain short linear motifs (SLiMs) that mediate protein-protein interactions (PPIs), which can be regulated by post-translational modifications like phosphorylation. 20% of pathogenic missense mutations are found in IDRs, and understanding how such mutations affect PPIs is essential for unraveling disease mechanisms. Here, we employ peptide-based interaction proteomics to investigate 36 disease-associated mutations affecting phosphorylation sites. Our results unveil significant differences in interactomes between phosphorylated and non-phosphorylated peptides, often due to disrupted phosphorylation-dependent SLiMs. We focused on a mutation of a serine phosphorylation site in the transcription factor GATAD1, which causes dilated cardiomyopathy. We find that this phosphorylation site mediates interaction with 14-3-3 family proteins. Follow-up experiments reveal the structural basis of this interaction and suggest that 14-3-3 binding affects GATAD1 nucleocytoplasmic transport by masking a nuclear localisation signal. Our results demonstrate that pathogenic mutations of human phosphorylation sites can significantly impact protein-protein interactions, offering insights into potential molecular mechanisms underlying pathogenesis.


Assuntos
Proteínas Intrinsicamente Desordenadas , Peptídeos , Humanos , Fosforilação , Peptídeos/metabolismo , Processamento de Proteína Pós-Traducional , Regulação da Expressão Gênica , Mutação , Proteínas Intrinsicamente Desordenadas/metabolismo , Ligação Proteica , Sítios de Ligação , Proteínas do Olho/genética
12.
Nat Commun ; 15(1): 3186, 2024 Apr 15.
Artigo em Inglês | MEDLINE | ID: mdl-38622114

RESUMO

Transcription termination factor ρ is a hexameric, RNA-dependent NTPase that can adopt active closed-ring and inactive open-ring conformations. The Sm-like protein Rof, a homolog of the RNA chaperone Hfq, inhibits ρ-dependent termination in vivo but recapitulation of this activity in vitro has proven difficult and the precise mode of Rof action is presently unknown. Here, our cryo-EM structures of ρ-Rof and ρ-RNA complexes show that Rof undergoes pronounced conformational changes to bind ρ at the protomer interfaces, undercutting ρ conformational dynamics associated with ring closure and occluding extended primary RNA-binding sites that are also part of interfaces between ρ and RNA polymerase. Consistently, Rof impedes ρ ring closure, ρ-RNA interactions and ρ association with transcription elongation complexes. Structure-guided mutagenesis coupled with functional assays confirms that the observed ρ-Rof interface is required for Rof-mediated inhibition of cell growth and ρ-termination in vitro. Bioinformatic analyses reveal that Rof is restricted to Pseudomonadota and that the ρ-Rof interface is conserved. Genomic contexts of rof differ between Enterobacteriaceae and Vibrionaceae, suggesting distinct modes of Rof regulation. We hypothesize that Rof and other cellular anti-terminators silence ρ under diverse, but yet to be identified, stress conditions when unrestrained transcription termination by ρ may be detrimental.


Assuntos
Fator Rho , Fatores de Transcrição , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismo , Fator Rho/química , Transcrição Gênica , RNA/genética , Sítios de Ligação , Regulação Bacteriana da Expressão Gênica , RNA Bacteriano/genética
13.
Phys Chem Chem Phys ; 26(16): 12880-12891, 2024 Apr 24.
Artigo em Inglês | MEDLINE | ID: mdl-38625412

RESUMO

Protein-ligand binding affinity prediction plays an important role in the field of drug discovery. Existing deep learning-based approaches have significantly improved the efficiency of protein-ligand binding affinity prediction through their excellent inductive bias capability. However, these methods only focus on fragmented three-dimensional data, which truncates the integrity of pocket data, leading to the neglect of potential long-range interactions. In this paper, we propose a dual-stream framework, with amino acid sequence assisting the atomic data fusion for graph neural network (termed SadNet), to fuse both 3D atomic data and sequence data for more accurate prediction results. In detail, SadNet consists of a pocket module and a sequence module. The sequence module expands the "receptive field" of the pocket module through a mid-term virtual node fusion. To better integrate sequence-level information from the sequence module and 3D structural information from the pocket module, we incorporate structural information for each amino acid within the sequence module. Besides, to better understand the intrinsic relationship between sequences and 3D atomic information, our SadNet utilizes information stacking from both the early stage and later stage. Experimental results on publicly available benchmark datasets demonstrate the superiority of the proposed dual-stream approach over the state-of-the-art alternatives. The code of this work is available online at https://github.com/wardhong/SadNet.


Assuntos
Proteínas , Ligantes , Proteínas/química , Proteínas/metabolismo , Redes Neurais de Computação , Ligação Proteica , Aprendizado Profundo , Sequência de Aminoácidos , Sítios de Ligação
14.
Proc Natl Acad Sci U S A ; 121(18): e2319384121, 2024 Apr 30.
Artigo em Inglês | MEDLINE | ID: mdl-38652746

RESUMO

Clearance of serotonin (5-hydroxytryptamine, 5-HT) from the synaptic cleft after neuronal signaling is mediated by serotonin transporter (SERT), which couples this process to the movement of a Na+ ion down its chemical gradient. After release of 5-HT and Na+ into the cytoplasm, the transporter faces a rate-limiting challenge of resetting its conformation to be primed again for 5-HT and Na+ binding. Early studies of vesicles containing native SERT revealed that K+ gradients can provide an additional driving force, via K+ antiport. Moreover, under appropriate conditions, a H+ ion can replace K+. Intracellular K+ accelerates the resetting step. Structural studies of SERT have identified two binding sites for Na+ ions, but the K+ site remains enigmatic. Here, we show that K+ antiport can drive substrate accumulation into vesicles containing SERT extracted from a heterologous expression system, allowing us to study the residues responsible for K+ binding. To identify candidate binding residues, we examine many cation binding configurations using molecular dynamics simulations, predicting that K+ binds to the so-called Na2 site. Site-directed mutagenesis of residues in this site can eliminate the ability of both K+ and H+ to drive 5-HT accumulation into vesicles and, in patch clamp recordings, prevent the acceleration of turnover rates and the formation of a channel-like state by K+ or H+. In conclusion, the Na2 site plays a pivotal role in orchestrating the sequential binding of Na+ and then K+ (or H+) ions to facilitate 5-HT uptake in SERT.


Assuntos
Simulação de Dinâmica Molecular , Potássio , Proteínas da Membrana Plasmática de Transporte de Serotonina , Sódio , Proteínas da Membrana Plasmática de Transporte de Serotonina/metabolismo , Proteínas da Membrana Plasmática de Transporte de Serotonina/química , Proteínas da Membrana Plasmática de Transporte de Serotonina/genética , Potássio/metabolismo , Sítios de Ligação , Humanos , Sódio/metabolismo , Serotonina/metabolismo , Ligação Proteica , Animais
15.
J Med Chem ; 67(8): 6384-6396, 2024 Apr 25.
Artigo em Inglês | MEDLINE | ID: mdl-38574272

RESUMO

Peptide deformylase (PDF) is involved in bacterial protein maturation processes. Originating from the interest in a new antibiotic, tremendous effort was put into the refinement of PDF inhibitors (PDFIs) and their selectivity. We obtained a full NMR backbone assignment the emergent additional protein backbone resonances of ecPDF 1-147 in complex with 2-(5-bromo-1H-indol-3-yl)-N-hydroxyacetamide (2), a potential new structural scaffold for more selective PDFIs. We also determined the complex crystal structures of E. coli PDF (ecPDF fl) and 2. Our structure suggests an alternative ligand conformation within the protein, a possible starting point for further selectivity optimization. The orientation of the second ligand conformation in the crystal structure points toward a small region of the S1' pocket, which differs between bacterial PDFs and human PDF. Moreover, we analyzed the binding mode of 2 via NMR TITAN line shape analysis, revealing an induced fit mechanism.


Assuntos
Amidoidrolases , Antibacterianos , Escherichia coli , Amidoidrolases/antagonistas & inibidores , Amidoidrolases/metabolismo , Amidoidrolases/química , Antibacterianos/farmacologia , Antibacterianos/química , Escherichia coli/enzimologia , Escherichia coli/efeitos dos fármacos , Cristalografia por Raios X , Sítios de Ligação , Inibidores Enzimáticos/química , Inibidores Enzimáticos/farmacologia , Modelos Moleculares , Humanos , Relação Estrutura-Atividade
16.
Biochemistry ; 63(8): 1038-1050, 2024 Apr 16.
Artigo em Inglês | MEDLINE | ID: mdl-38577885

RESUMO

The ethylene-forming enzyme (EFE) is an Fe(II), 2-oxoglutarate (2OG), and l-arginine (l-Arg)-dependent oxygenase that either forms ethylene and three CO2/bicarbonate from 2OG or couples the decarboxylation of 2OG to C5 hydroxylation of l-Arg. l-Arg binds with C5 toward the metal center, causing 2OG to change from monodentate to chelate metal interaction and OD1 to OD2 switch of D191 metal coordination. We applied anaerobic UV-visible spectroscopy, X-ray crystallography, and computational approaches to three EFE systems with high-resolution structures. The ineffective l-Arg analogue l-canavanine binds to the EFE with O5 pointing away from the metal center while promoting chelate formation by 2OG but fails to switch the D191 metal coordination from OD1 to OD2. Substituting alanine for R171 that interacts with 2OG and l-Arg inactivates the protein, prevents metal chelation by 2OG, and weakens l-Arg binding. The R171A EFE had electron density at the 2OG binding site that was identified by mass spectrometry as benzoic acid. The substitution by alanine of Y306 in the EFE, a residue 12 Å away from the catalytic metal center, generates an interior cavity that leads to multiple local and distal structural changes that reduce l-Arg binding and significantly reduce the enzyme activity. Flexibility analyses revealed correlated and anticorrelated motions in each system, with important distinctions from the wild-type enzyme. In combination, the results are congruent with the currently proposed enzyme mechanism, reinforce the importance of metal coordination by OD2 of D191, and highlight the importance of the second coordination sphere and longer range interactions in promoting EFE activity.


Assuntos
Canavanina , Compostos Ferrosos , Liases , Compostos Ferrosos/metabolismo , Sítios de Ligação , Alanina , Ácidos Cetoglutáricos/metabolismo
17.
Bioinformatics ; 40(4)2024 Mar 29.
Artigo em Inglês | MEDLINE | ID: mdl-38561176

RESUMO

MOTIVATION: Understanding the intermolecular interactions of ligand-target pairs is key to guiding the optimization of drug research on cancers, which can greatly mitigate overburden workloads for wet labs. Several improved computational methods have been introduced and exhibit promising performance for these identification tasks, but some pitfalls restrict their practical applications: (i) first, existing methods do not sufficiently consider how multigranular molecule representations influence interaction patterns between proteins and compounds; and (ii) second, existing methods seldom explicitly model the binding sites when an interaction occurs to enable better prediction and interpretation, which may lead to unexpected obstacles to biological researchers. RESULTS: To address these issues, we here present DrugMGR, a deep multigranular drug representation model capable of predicting binding affinities and regions for each ligand-target pair. We conduct consistent experiments on three benchmark datasets using existing methods and introduce a new specific dataset to better validate the prediction of binding sites. For practical application, target-specific compound identification tasks are also carried out to validate the capability of real-world compound screen. Moreover, the visualization of some practical interaction scenarios provides interpretable insights from the results of the predictions. The proposed DrugMGR achieves excellent overall performance in these datasets, exhibiting its advantages and merits against state-of-the-art methods. Thus, the downstream task of DrugMGR can be fine-tuned for identifying the potential compounds that target proteins for clinical treatment. AVAILABILITY AND IMPLEMENTATION: https://github.com/lixiaokun2020/DrugMGR.


Assuntos
Proteínas , Ligantes , Proteínas/química , Sítios de Ligação
18.
J Chem Theory Comput ; 20(8): 3335-3348, 2024 Apr 23.
Artigo em Inglês | MEDLINE | ID: mdl-38563746

RESUMO

Protein-protein interactions mediate most molecular processes in the cell, offering a significant opportunity to expand the set of known druggable targets. Unfortunately, targeting these interactions can be challenging due to their typically flat and featureless interaction surfaces, which often change as the complex forms. Such surface changes may reveal hidden (cryptic) druggable pockets. Here, we analyze a set of well-characterized protein-protein interactions harboring cryptic pockets and investigate the predictive power of current computational methods. Based on our observations, we developed a new computational strategy, SWISH-X (SWISH Expanded), which combines the established cryptic pocket identification capabilities of SWISH with the rapid temperature range exploration of OPES MultiThermal. SWISH-X is able to reliably identify cryptic pockets at protein-protein interfaces while retaining its predictive power for revealing cryptic pockets in isolated proteins, such as TEM-1 ß-lactamase.


Assuntos
Proteínas , beta-Lactamases , beta-Lactamases/química , beta-Lactamases/metabolismo , Proteínas/química , Proteínas/metabolismo , Ligação Proteica , Sítios de Ligação , Simulação de Dinâmica Molecular
19.
Int J Mol Sci ; 25(7)2024 Mar 31.
Artigo em Inglês | MEDLINE | ID: mdl-38612736

RESUMO

The discovery of new genes with novel functions is a major driver of adaptive evolutionary innovation in plants. Especially in woody plants, due to genome expansion, new genes evolve to regulate the processes of growth and development. In this study, we characterized the unique VeA transcription factor family in Populus alba × Populus glandulosa, which is associated with secondary metabolism. Twenty VeA genes were characterized systematically on their phylogeny, genomic distribution, gene structure and conserved motif, promoter binding site, and expression profiling. Furthermore, through ChIP-qPCR, Y1H, and effector-reporter assays, it was demonstrated that PagMYB128 directly regulated PagVeA3 to influence the biosynthesis of secondary metabolites. These results provide a basis for further elucidating the function of VeAs gene in poplar and its genetic regulation mechanism.


Assuntos
Populus , Fatores de Transcrição , Fatores de Transcrição/genética , Populus/genética , Genômica , Sítios de Ligação , Bioensaio
20.
BMC Bioinformatics ; 25(1): 158, 2024 Apr 20.
Artigo em Inglês | MEDLINE | ID: mdl-38643066

RESUMO

BACKGROUND: Motif finding in Assay for Transposase-Accessible Chromatin using sequencing (ATAC-seq) data is essential to reveal the intricacies of transcription factor binding sites (TFBSs) and their pivotal roles in gene regulation. Deep learning technologies including convolutional neural networks (CNNs) and graph neural networks (GNNs), have achieved success in finding ATAC-seq motifs. However, CNN-based methods are limited by the fixed width of the convolutional kernel, which makes it difficult to find multiple transcription factor binding sites with different lengths. GNN-based methods has the limitation of using the edge weight information directly, makes it difficult to aggregate the neighboring nodes' information more efficiently when representing node embedding. RESULTS: To address this challenge, we developed a novel graph attention network framework named MMGAT, which employs an attention mechanism to adjust the attention coefficients among different nodes. And then MMGAT finds multiple ATAC-seq motifs based on the attention coefficients of sequence nodes and k-mer nodes as well as the coexisting probability of k-mers. Our approach achieved better performance on the human ATAC-seq datasets compared to existing tools, as evidenced the highest scores on the precision, recall, F1_score, ACC, AUC, and PRC metrics, as well as finding 389 higher quality motifs. To validate the performance of MMGAT in predicting TFBSs and finding motifs on more datasets, we enlarged the number of the human ATAC-seq datasets to 180 and newly integrated 80 mouse ATAC-seq datasets for multi-species experimental validation. Specifically on the mouse ATAC-seq dataset, MMGAT also achieved the highest scores on six metrics and found 356 higher-quality motifs. To facilitate researchers in utilizing MMGAT, we have also developed a user-friendly web server named MMGAT-S that hosts the MMGAT method and ATAC-seq motif finding results. CONCLUSIONS: The advanced methodology MMGAT provides a robust tool for finding ATAC-seq motifs, and the comprehensive server MMGAT-S makes a significant contribution to genomics research. The open-source code of MMGAT can be found at https://github.com/xiaotianr/MMGAT , and MMGAT-S is freely available at https://www.mmgraphws.com/MMGAT-S/ .


Assuntos
Sequenciamento de Cromatina por Imunoprecipitação , Genômica , Humanos , Animais , Camundongos , Sítios de Ligação , Ligação Proteica , Genômica/métodos , Cromatina/genética , Fatores de Transcrição/metabolismo
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...